Search CORE

5 research outputs found

Efficient storage of versioned matrices

Author: Seering Adam B
Publication venue: Massachusetts Institute of Technology
Publication date: 01/01/2011
Field of study

Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2011.This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections.Cataloged from student submitted PDF version of thesis.Includes bibliographical references (p. 95-96).Versioned-matrix storage is increasingly important in scientific applications. Various computer-based scientific research, from astronomy observations to weather predictions to mechanical finite-element analyses, results in the generation of large matrices that must be stored and retrieved. Such matrices are often versioned; an initial matrix is stored, then a subsequent matrix based on the first is produced, then another subsequent matrix after that. For large databases of matrices, available disk storage can be a substantial constraint. I propose a framework and programming interface for storing such versioned matrices, and consider a variety of intra-matrix and inter-matrix approaches to data storage and compression, taking into account disk-space usage, performance for inserting data, and performance for retrieving data from the database. For inter-matrix "delta" compression, I explore and compare several differencing algorithms, and several means of selecting which arrays are differenced against each other, with the aim of optimizing both disk-space usage and insert and retrieve performance. This work shows that substantial disk-space savings and performance improvements can be achieved by judicious use of these techniques. In particular, a combination of Lempel-Ziv compression and a proposed form of delta compression, it is possible to both decrease disk usage by a factor of 10 and increase query performance for a factor of two or more, for particular data sets and query workloads. Various other strategies can dramatically improve query performance in particular edge cases; for example, a technique called "chunking", where a matrix is broken up and saved as several files on disk, can cause query runtime to be approximately linear in the amount of data requested rather than the size of the raw matrix on disk.by Adam B. Seering.M.Eng

DSpace@MIT

Efficient Versioning for Scientific Array Databases

Author: Cudre-Mauroux Philippe
Madden Samuel R.
Seering Adam
Stonebraker Michael
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/04/2012
Field of study

In this paper, we describe a versioned database storage manager we are developing for the SciDB scientific database. The system is designed to efficiently store and retrieve array-oriented data, exposing a "no-overwrite" storage model in which each update creates a new "version" of an array. This makes it possible to perform comparisons of versions produced at different times or by different algorithms, and to create complex chains and trees of versions. We present algorithms to efficiently encode these versions, minimizing storage space while still providing efficient access to the data. Additionally, we present an optimal algorithm that, given a long sequence of versions, determines which versions to encode in terms of each other (using delta compression) to minimize total storage space or query execution cost. We compare the performance of these algorithms on real world data sets from the National Oceanic and Atmospheric Administration (NOAA), Open Street Maps, and several other sources. We show that our algorithms provide better performance than existing version control systems not optimized for array data, both in terms of storage size and access time, and that our delta-compression algorithms are able to substantially reduce the total storage space when versions exist with a high degree of similarity.National Science Foundation (U.S.) (Grant IIS/III-1111371)National Science Foundation (U.S.) (Grant SI2-1047955

Crossref

DSpace@MIT

pyodide/pyodide: 0.24.0

Author: Adam Seering
Alexey Ignatiev
Bart Broere
casatir
Chris Trevino
Christian Clauss
Deepak Cherian
Dexter Chua
Grimmer Kang
Gyeongjae Choi
Henry Schreiner
Hood Chatham
Jan Max Meyer
Jason Stafford
Jo Bovy
Joe Marshall
Kai Tietz
Loïc Estève
Madhur Tandon
Marc Abramowitz
Michael Droettboom
Michael Greminger
Qijia Liu
Roman Yurchak
Seungmin Kim
Tomas R
Wei Ouyang
Will Lachance
Publication venue
Publication date: 13/09/2023
Field of study

Pyodide is a Python distribution for the browser and Node.js based on WebAssembl

ZENODO

Perioperative pain management for shoulder surgery: evolving techniques

Author: Abdallah
Abildgaard
Adam
Aguirre
Ahn
Al-Kaisy
Alfuth
Aliste
Assareh
Attardi
Auyong
Auyong
Axelsson
Aydogan
Bang
Barber
Behr
Bengisun
Bertolini
Bishop
Bjornholdt
Blomquist
Boddu
Bojaxhi
Borgeat
Boyer
Cabaton
Campbell
Chalifoux
Chalmers
Cheah
Chen
Chou
Cicero
Codding
Constantinescu
Culebras
Cummings
Dahl
Dambros
De Cosmo
Demir
Desmet
Desroches
Dimmen
Dimmen
Divella
Doleman
Dong
Dowell
Dwyer
Elbahrawy
Eskandar
Faria-Silva
Farley
Faunø
Ford
Fredrickson
Fritsch
Gallipani
Gibbons
Gil
Gillis
Glod
Goebel
Gomide
Graham
Guay
Hah
Hamal
Han
Hannan
Hasan
Hernandez-Boussard
Hoe-Hansen
Inderhaug
Jackson
Jadon
Jarde
Jibril
Jin
Jung
Kang
Kang
Karaman
Kawabata
Kawanishi
Kawasaki
Kehlet
Kelly
Khalili
Khetarpal
Kim
Kim
Koh
Koltka
Kopacz
Kraeutler
Kullenberg
Kuyucu
Langford
Leas
Lee
Leegwater
Lemay
Leroux
Levy
Lim
Lovecchio
Malhotra
Mallet
Manchikanti
Manyande
Mardani-Kivi
Montazeri
Morris
Morsi
Namdari
Namdari
Nelson
Noyes
O'Neal
Oh
Okoroha
Osbahr
Overton
Panchamia
Patel
Patterson
Piana
Politi
Price
Psaty
Rao
Robinson
Rodgers
Rouhani
Routman
Ryu
Rø
Sabesan
Sabesan
Saito
Sakae
Salmon
Schubert
Scott
Seering
Sethi
Shah
Sicard
Sills
Simon
Sinatra
Singelyn
Singh
Singla
Sjöling
Smith
Smith
Soulioti
Speer
Spence
Sripada
Stepan
Stiller
Stundner
Stålman
Suzuki
Syed
Szeverenyi
Tandoc
Teerawattananon
Tetzlaff
Thienpont
Toivonen
Tokish
Trabelsi
Vandepitte
Vieira
Warrender
Watanabe
Webb
Weller
Welton
Wiesmann
Williams
Wilson
Wong
Woo
Woolf
Yajnik
Yu
Yu
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref